Fault-Tolerant Data Transfer in a Multiprocessor System by Forward and Backward Hardware Error Recovery

نویسندگان

  • Reinhard Männer
  • Oliver Stucky
چکیده

A MIMD processor system is described, which allows forward and backward hardware error recovery (FHER and BHER) for data transfers. Fault-tolerant operation is provided by a multiple bus system and distributed error correction controller (ECC) hardware. Addresses and data are transferred through the bus system together with an error correction code, which is generated and checked by the ECC hardware. FHER is possible for all single-bit errors. Overlapping bus protocols allow these distributed ECC operations to be executed and synchronised concurrently with the transfer. Double-bit and multi-bit transfer errors as well as arbitration errors in the multiple bus system are detected and corrected by automatic rerouteing of the transfer path (BHER). These features improve the reliability of large multiprocessor systems significantly.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Study of VHDL Implementation of FT-2D-cGA and FT-3D-cGA on Different Benchmarks (RESEARCH NOTE)

This paper presents the VHDL implementation of fault tolerant cellular genetic algorithm. The goal of paper is to harden the hardware implementation of the cGA against single error upset (SEU), when affecting the fitness registers in the target hardware. The proposed approach, consists of two phases; Error monitoring and error recovery. Using innovative connectivity between processing elements ...

متن کامل

Fault-Tolerant Distributed-Shared-Memory on a Broadcast-Based Interconnection Network

The Simultaneous Optical Multiprocessor Exchange Bus (SOME-Bus) is a low-latency, high-bandwidth interconnection network which directly links arbitrary pairs of processor nodes without contention, and can efficiently interconnect over one hundred nodes. Each node has a dedicated output channel and an array of receivers, with one receiver dedicated to every other node’s output channel. The SOME-...

متن کامل

Backward Error Recovery in Redundant Disk Arrays

Redundant disk arrays are single fault tolerant, incorporating a layer of error handling not found in nonredundant disk systems. Recovery from these errors is complex, due in part to the large number of erroneous states the system may reach. The established approach to error recovery in disk systems is to transition directly from an erroneous state to completion. This technique, known as forwar...

متن کامل

A Microprocessor-Based Hybrid Duplex Fault-Tolerant System

Reliability is one of the fundamental considerations in the design of industrial control equipment. The microprocessor-based Hybrid Duplex fault-tolerant System (HDS) proposed in this paper has high reliability to meet this demand although its hardware structure is simple. The hardware configuration of HDS and the fault tolerance of this system are described. The switching control strategies in...

متن کامل

Adaptable Fault Tolerance Configurations for Multiprocessor Systems

The escalating increase in the complexity of multiprocessor systems increases the probability of faults occurring in these systems As a consequence there is a great need for achieving fault-tolerance of processing in multiprocessor systems. Faulttolerance generally requires some forms of hardware and/or time redundancy. Two fault tolerant configurations are proposed for both single and double t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Comput. J.

دوره 35  شماره 

صفحات  -

تاریخ انتشار 1992